Co-regularization Based Analysis of Feature Sharing Algorithms

نویسندگان

  • Abhishek Kumar
  • Avishek Saha
  • Hal Daumé
  • Tom Fletcher
  • Suresh Venkatasubramanian
چکیده

A common approach in domain adaptation (DA) [1] and multitask learning (MTL) [2] is to create an expanded feature representation by sharing features across domains (in DA) or across tasks (in MTL) and then learning a classifier over this expanded feature set. In this paper, we refer to such techniques as feature sharing algorithms (FSA). One such FSA is EASYADAPT [1], which takes each feature in the original problem and replicates it three times: general, source-specific and target-specific. In this extended feature space the source data will contain general and sourcespecific features whereas the target data will contain general and target-specific features. EASYADAPT is simple, easy to implement as a preprocessing step and outperforms many existing techniques [1, 3], namely, SOURCEONLY (target hypothesis trained on labeled source data only), TARGETONLY (target hypothesis trained on labeled target data only), ALL (combination of source and target labeled data) and PRIOR (target hypothesis trained with SOURCEONLY as a prior on the weight vector) [4]. However, a theoretical analysis of why EASYADAPT performs better than the other aforementioned approaches is clearly missing. In this abstract, we present such an analysis. In order to achieve our goal, we model EASYADAPT in terms of co-regularization. This is an idea that originated in the context of multiview learning and for which there exists some theoretical analysis [5]. We denote source and target empirical errors for some hypothesis h as ǫ̂s(h) and ǫ̂t(h) and the corresponding expected errors as ǫs(h) and ǫt(h). PRIOR and EASYADAPT optimize the following cost functions:

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Second-Order Statistical Texture Representation of Asphalt Pavement Distress Images Based on Local Binary Pattern in Spatial and Wavelet Domain

Assessment of pavement distresses is one of the important parts of pavement management systems to adopt the most effective road maintenance strategy. In the last decade, extensive studies have been done to develop automated systems for pavement distress processing based on machine vision techniques. One of the most important structural components of computer vision is the feature extraction met...

متن کامل

Exploiting Task-Feature Co-Clusters in Multi-Task Learning

In multi-task learning, multiple related tasks are considered simultaneously, with the goal to improve the generalization performance by utilizing the intrinsic sharing of information across tasks. This paper presents a multitask learning approach by modeling the task-feature relationships. Specifically, instead of assuming that similar tasks have similar weights on all the features, we start w...

متن کامل

Improving of Feature Selection in Speech Emotion Recognition Based-on Hybrid Evolutionary Algorithms

One of the important issues in speech emotion recognizing is selecting of appropriate feature sets in order to improve the detection rate and classification accuracy. In last studies researchers tried to select the appropriate features for classification by using the selecting and reducing the space of features methods, such as the Fisher and PCA. In this research, a hybrid evolutionary algorit...

متن کامل

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010